A computational tool for the genomic identification of regions of unusual compositional properties and its utilization in the detection of horizontally transferred sequences.

نویسندگان

  • Catherine Putonti
  • Yi Luo
  • Charles Katili
  • Sergey Chumakov
  • George E Fox
  • Dan Graur
  • Yuriy Fofanov
چکیده

Similarity Plot (S-plot) is a Windows-based application for large-scale comparisons and 2-dimensional visualization of compositional similarities between genomic sequences. This application combines 2 approaches widely used in genomics: window analysis of statistical characteristics along genomes and dot-plot visual representation. S-plot is effective in identifying highly similar regions between genomes as well as regions with unusual compositional properties (RUCPs) within a single genome, which may be indicative of horizontal gene transfer or of locus-specific selective forces. We use S-plot to identify regions that may have originated through horizontal gene transfer through a 2-step approach, by first comparing a genomic sequence to itself and, subsequently, comparing it to the genomic sequence of a closely related taxon. Moreover, by comparing these suspect sequences to one another, we can estimate a minimum number of sources for these putative xenologous sequences. We illustrate the uses of S-plot in a comparison involving Escherichia coli K12 and E. coli O157:H7. In O157:H7, we found 145 regions that have most probably originated through horizontal gene transfer. By using S-plot to compare each of these regions with 277 completely sequenced prokaryotic genomes, 1 sequence was found to have similar compositional properties to the Yersinia pseudotuberculosis genome, indicating a transfer from a Yersinia or Yersinia relative. Based upon our analysis of RUCPs in O157:H7, we infer that there were at least 53 sources of horizontally transferred sequences.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Simple Genome Walking Strategy to Isolate Unknown Genomic Regions Using Long Primer and RAPD Primer

Background: Genome walking is a DNA-cloning methodology that is used to isolate unknown genomic regions adjacent to known sequences. However, the existing genome-walking methods have their own limitations. Objectives: Our aim was to provide a simple and efficient genome-walking technology. Material and Methods: In this paper, we dev...

متن کامل

Regions of Unusual Statistical Properties as Tools in the Search for Horizontally Transferred Genes in Escherichia coli

The observed diversity of statistical characteristics along genomic sequences is the result of the influences of a variety of ongoing processes including horizontal gene transfer, gene loss, genome rearrangements, and evolution. The rate at which various processes affect the genome typically varies between different genomic regions. Thus, variations in statistical properties seen in different r...

متن کامل

Serological and genomic detection of bovine leukemia virus in human and cattle samples

Bovine leukemia virus (BLV) is a retrovirus responsible for lymphoproliferative disorders in cattle. Although infections of BLV in animals are well known, little is known about its capacity to infect humans. This study investigated the presence of anti-BLV antibodies and BLV proviruses in human and cattle samples. An indirect enzyme-linked immunosorbent assay (ELISA) was used to detect anti-BL...

متن کامل

Serological and genomic detection of bovine leukemia virus in human and cattle samples

Bovine leukemia virus (BLV) is a retrovirus responsible for lymphoproliferative disorders in cattle. Although infections of BLV in animals are well known, little is known about its capacity to infect humans. This study investigated the presence of anti-BLV antibodies and BLV proviruses in human and cattle samples. An indirect enzyme-linked immunosorbent assay (ELISA) was used to detect anti-BL...

متن کامل

IGIPT - Integrated genomic island prediction tool

UNLABELLED IGIPT is a web-based integrated platform for the identification of genomic islands (GIs). It incorporates thirteen parametric measures based on anomalous nucleotide composition on a single platform, thus improving the predictive power of a horizontally acquired region, since it is known that no single measure can absolutely predict a horizontally transferred region. The tool filters ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Molecular biology and evolution

دوره 23 10  شماره 

صفحات  -

تاریخ انتشار 2006